منابع مشابه
Data Compression Considering Text Files
Lossless text data compression is an important field as it significantly reduces storage requirement and communication cost. In this work, the focus is directed mainly to different file compression coding techniques and comparisons between them. Some memory efficient encoding schemes are analyzed and implemented in this work. They are: Shannon Fano Coding, Huffman Coding, Repeated Huffman Codin...
متن کاملTransform Methods Used in Lossless Compression of Text Files
This paper presents a study of transform methods used in lossless text compression in order to preprocess the text by exploiting the inner redundancy of the source file. The transform methods are Burrows-Wheeler Transform (BWT, also known as Block Sorting), Star Transform and LengthIndex Preserving Transform (LIPT). BWT converts the original blocks of data into a format that is extremely well s...
متن کاملText comparison using data compression
Similarity detection is very important in the field of spam detection, plagiarism detection or topic detection. The main algorithm for comparison of text document is based on the Kolmogorov Complexity, which is one of the perfect measures for computation of the similarity of two strings in defined alphabet. Unfortunately, this measure is incomputable and we must define several approximations wh...
متن کاملData Compression Using Encrypted Text
In this paper, we present a new algorithm for text compression. The basic idea of our algorithm is to define a unique encryption or signature of each word in the dict.ionary by replacing certain characters in the words by a special character ‘*’ and retaining a few characters so that the word is still retrievable. For any encrypted text the most frequently used character is ‘*’ and the standard...
متن کاملCompression of Unicode Files
The increasing importance of Unicode for text files, for example with Java and in some modern operating systems, implies a possible doubling of data storage space and data transmission time, with a corresponding need for data compression. However it is not clear that data compressors designed for 8-bit byte data are well matched to 16-bit Unicode data. This paper investigates the compression of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2014
ISSN: 0975-8887
DOI: 10.5120/15765-4456